Using Perspective Schemata to Model the ETL Process
نویسندگان
چکیده
Data Warehouses (DWs) are repositories which contain the unified history of an enterprise for decision support. The data must be Extracted from information sources, Transformed and integrated to be Loaded (ETL) into the DW, using ETL tools. These tools focus on data movement, where the models are only used as a means to this aim. Under a conceptual viewpoint, the authors want to innovate the ETL process in two ways: 1) to make clear compatibility between models in a declarative fashion, using correspondence assertions and 2) to identify the instances of different sources that represent the same entity in the real-world. This paper presents the overview of the proposed framework to model the ETL process, which is based on the use of a reference model and perspective schemata. This approach provides the designer with a better understanding of the semantic associated with the ETL process. Keywords—conceptual data model, correspondence assertions, data warehouse, data integration, ETL process, object relational database.
منابع مشابه
بهبود فرآیند استخراج، تبدیل و بارگذاری در پایگاه داده تحلیلی با کمک پردازش موازی
Abstract Data Warehouses are used to store data in a structure that facilitates data analysis. The process of Extracting, Transforming, and Loading (ETL) covers the process of retrieving required data from the source system and loading them to the data warehouse. Although the structure of source data (e.g. ER model) and DW (e.g. star schema) are usually specified, there is a clear lack of a ...
متن کاملQuarry: Digging Up the Gems of Your Data Treasury
The design lifecycle of a data warehousing (DW) system is primarily led by requirements of its end-users and the complexity of underlying data sources. The process of designing a multidimensional (MD) schema and back-end extracttransform-load (ETL) processes, is a long-term and mostly manual task. As enterprises shift to more real-time and ’onthe-fly’ decision making, business intelligence (BI)...
متن کاملOntology Development for ETL Process Design
The Extract, Transform, Load (ETL) process design is difficult to perform because of the ambiguity of user requirements and the complexity of data integration and transformation. Current studies have explored the ontology-based approach to overcome these limitations by reconciling the semantics of user requirements within the ETL process design for easy generation of the ETL process specificati...
متن کاملCo-evolution Model for Data Sources and Views
ETL process evolution is investigated below. A model-driven approach to templates and ETL process evolution problem is developed. We suppose that the ETL process evolution problem is mainly a problem of a low abstraction level. So the definition of ETL process based on a conceptual model is a principal step towards effective ETL evolution. Our approach seems to be scalable, robust and simpler i...
متن کاملBPMN-Based Conceptual Modeling of ETL Processes
Business Intelligence (BI) solutions require the design and implementation of complex processes (denoted ETL) that extract, transform, and load data from the sources to a common repository. New applications, like for example, real-time data warehousing, require agile and flexible tools that allow BI users to take timely decisions based on extremely up-to-date data. This calls for new ETL tools ...
متن کامل